Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 13349 |
| Missing cells | 3952 |
| Missing cells (%) | 1.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.4 MiB |
| Average record size in memory | 191.4 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 5 |
Year has constant value "2019" | Constant |
Duration has a high cardinality: 373 distinct values | High cardinality |
Source is highly correlated with Destination | High correlation |
Destination is highly correlated with Source | High correlation |
Total_Stops is highly correlated with Price and 1 other fields | High correlation |
Price is highly correlated with Total_Stops and 1 other fields | High correlation |
duration_hour is highly correlated with Total_Stops and 1 other fields | High correlation |
Source is highly correlated with Destination | High correlation |
Destination is highly correlated with Source | High correlation |
Total_Stops is highly correlated with Price and 1 other fields | High correlation |
Price is highly correlated with Total_Stops and 1 other fields | High correlation |
duration_hour is highly correlated with Total_Stops and 1 other fields | High correlation |
Source is highly correlated with Destination | High correlation |
Destination is highly correlated with Source | High correlation |
Total_Stops is highly correlated with Price and 1 other fields | High correlation |
Price is highly correlated with Total_Stops and 1 other fields | High correlation |
duration_hour is highly correlated with Total_Stops and 1 other fields | High correlation |
Source is highly correlated with Year | High correlation |
Month is highly correlated with Year | High correlation |
Total_Stops is highly correlated with Year | High correlation |
Year is highly correlated with Source and 2 other fields | High correlation |
Airline is highly correlated with Source and 6 other fields | High correlation |
Source is highly correlated with Airline and 4 other fields | High correlation |
Destination is highly correlated with Source and 3 other fields | High correlation |
Total_Stops is highly correlated with Airline and 3 other fields | High correlation |
Additional_Info is highly correlated with Airline and 1 other fields | High correlation |
Price is highly correlated with Airline and 1 other fields | High correlation |
Month is highly correlated with Destination | High correlation |
Arrival_hour is highly correlated with Airline and 2 other fields | High correlation |
Arrival_min is highly correlated with Airline and 1 other fields | High correlation |
Dept_hour is highly correlated with Arrival_hour | High correlation |
duration_hour is highly correlated with Airline and 3 other fields | High correlation |
duration_min is highly correlated with Source | High correlation |
Price has 2669 (20.0%) missing values | Missing |
duration_min has 1282 (9.6%) missing values | Missing |
Airline has 405 (3.0%) zeros | Zeros |
Destination has 3581 (26.8%) zeros | Zeros |
Arrival_hour has 411 (3.1%) zeros | Zeros |
Arrival_min has 1827 (13.7%) zeros | Zeros |
Dept_min has 2590 (19.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-11 08:41:45.420689 |
|---|---|
| Analysis finished | 2022-04-11 08:42:38.663796 |
| Duration | 53.24 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
| Distinct | 10680 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4540.556596 |
| Minimum | 0 |
|---|---|
| Maximum | 10682 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 104.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 334.4 |
| Q1 | 1669 |
| median | 4007 |
| Q3 | 7345 |
| 95-th percentile | 10014.6 |
| Maximum | 10682 |
| Range | 10682 |
| Interquartile range (IQR) | 5676 |
Descriptive statistics
| Standard deviation | 3208.701447 |
|---|---|
| Coefficient of variation (CV) | 0.7066757962 |
| Kurtosis | -1.22238653 |
| Mean | 4540.556596 |
| Median Absolute Deviation (MAD) | 2671 |
| Skewness | 0.3320458831 |
| Sum | 60611890 |
| Variance | 10295764.98 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1774 | 2 | < 0.1% |
| 1776 | 2 | < 0.1% |
| 1777 | 2 | < 0.1% |
| 1778 | 2 | < 0.1% |
| 1779 | 2 | < 0.1% |
| 1780 | 2 | < 0.1% |
| 1781 | 2 | < 0.1% |
| 1782 | 2 | < 0.1% |
| 1783 | 2 | < 0.1% |
| Other values (10670) | 13329 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 2 | |
| 3 | 2 | |
| 4 | 2 | |
| 5 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 8 | 2 | |
| 9 | 2 | |
| 10 | 2 |
| Value | Count | Frequency (%) |
| 10682 | 1 | |
| 10681 | 1 | |
| 10680 | 1 | |
| 10679 | 1 | |
| 10678 | 1 | |
| 10677 | 1 | |
| 10676 | 1 | |
| 10675 | 1 | |
| 10674 | 1 | |
| 10673 | 1 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.977526406 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 405 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.364158661 |
|---|---|
| Coefficient of variation (CV) | 0.5943791239 |
| Kurtosis | 0.3292713726 |
| Mean | 3.977526406 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.7207852624 |
| Sum | 53096 |
| Variance | 5.589246173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 4743 | |
| 3 | 2564 | |
| 1 | 2190 | |
| 6 | 1543 | 11.6% |
| 8 | 1026 | 7.7% |
| 10 | 608 | 4.6% |
| 0 | 405 | 3.0% |
| 2 | 240 | 1.8% |
| 7 | 16 | 0.1% |
| 5 | 8 | 0.1% |
| Other values (2) | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 405 | 3.0% |
| 1 | 2190 | |
| 2 | 240 | 1.8% |
| 3 | 2564 | |
| 4 | 4743 | |
| 5 | 8 | 0.1% |
| 6 | 1543 | 11.6% |
| 7 | 16 | 0.1% |
| 8 | 1026 | 7.7% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 5 | < 0.1% |
| 10 | 608 | 4.6% |
| 9 | 1 | < 0.1% |
| 8 | 1026 | 7.7% |
| 7 | 16 | 0.1% |
| 6 | 1543 | 11.6% |
| 5 | 8 | 0.1% |
| 4 | 4743 | |
| 3 | 2564 | |
| 2 | 240 | 1.8% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.2 KiB |
| 2 | |
|---|---|
| 3 | |
| 0 | |
| 4 | |
| 1 | 456 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 0 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 5679 | |
| 3 | 3581 | |
| 0 | 2752 | |
| 4 | 881 | 6.6% |
| 1 | 456 | 3.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 5679 | |
| 3 | 3581 | |
| 0 | 2752 | |
| 4 | 881 | 6.6% |
| 1 | 456 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Destination
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.435313507 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 3581 |
| Zeros (%) | 26.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.473504718 |
|---|---|
| Coefficient of variation (CV) | 1.026608271 |
| Kurtosis | 0.6480054839 |
| Mean | 1.435313507 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.248116424 |
| Sum | 19160 |
| Variance | 2.171216153 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5679 | |
| 0 | 3581 | |
| 2 | 1582 | 11.9% |
| 5 | 1170 | 8.8% |
| 3 | 881 | 6.6% |
| 4 | 456 | 3.4% |
| Value | Count | Frequency (%) |
| 0 | 3581 | |
| 1 | 5679 | |
| 2 | 1582 | 11.9% |
| 3 | 881 | 6.6% |
| 4 | 456 | 3.4% |
| 5 | 1170 | 8.8% |
| Value | Count | Frequency (%) |
| 5 | 1170 | 8.8% |
| 4 | 456 | 3.4% |
| 3 | 881 | 6.6% |
| 2 | 1582 | 11.9% |
| 1 | 5679 | |
| 0 | 3581 |
| Distinct | 373 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 821.5 KiB |
| 2h 50m | 672 |
|---|---|
| 1h 30m | 493 |
| 2h 45m | 432 |
| 2h 55m | 418 |
| 2h 35m | 399 |
| Other values (368) |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.004869279 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 24 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2h 50m |
|---|---|
| 2nd row | 7h 25m |
| 3rd row | 5h 25m |
| 4th row | 4h 45m |
| 5th row | 2h 25m |
Common Values
| Value | Count | Frequency (%) |
| 2h 50m | 672 | 5.0% |
| 1h 30m | 493 | 3.7% |
| 2h 45m | 432 | 3.2% |
| 2h 55m | 418 | 3.1% |
| 2h 35m | 399 | 3.0% |
| 3h | 333 | 2.5% |
| 2h 20m | 286 | 2.1% |
| 2h 30m | 278 | 2.1% |
| 2h 40m | 196 | 1.5% |
| 2h 15m | 164 | 1.2% |
| Other values (363) | 9678 |
Length
| Value | Count | Frequency (%) |
| 2h | 2967 | 11.7% |
| 30m | 1818 | 7.2% |
| 20m | 1260 | 5.0% |
| 50m | 1205 | 4.7% |
| 45m | 1153 | 4.5% |
| 35m | 1149 | 4.5% |
| 15m | 1135 | 4.5% |
| 55m | 1121 | 4.4% |
| 25m | 1009 | 4.0% |
| 40m | 803 | 3.2% |
| Other values (44) | 11796 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Total_Stops
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 782.3 KiB |
| 1.0 | |
|---|---|
| 0.0 | |
| 2.0 | |
| 3.0 | 56 |
| 4.0 | 2 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 7055 | |
| 0.0 | 4340 | |
| 2.0 | 1895 | 14.2% |
| 3.0 | 56 | 0.4% |
| 4.0 | 2 | < 0.1% |
| (Missing) | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 7055 | |
| 0.0 | 4340 | |
| 2.0 | 1895 | 14.2% |
| 3.0 | 56 | 0.4% |
| 4.0 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.407745899 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 20 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 8 |
| Q3 | 8 |
| 95-th percentile | 8 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.198391911 |
|---|---|
| Coefficient of variation (CV) | 0.1617755155 |
| Kurtosis | 2.397648133 |
| Mean | 7.407745899 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -1.784859053 |
| Sum | 98886 |
| Variance | 1.436143173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 10489 | |
| 5 | 2425 | 18.2% |
| 7 | 396 | 3.0% |
| 0 | 20 | 0.1% |
| 4 | 8 | 0.1% |
| 3 | 5 | < 0.1% |
| 6 | 3 | < 0.1% |
| 1 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 20 | 0.1% |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 5 | < 0.1% |
| 4 | 8 | 0.1% |
| 5 | 2425 | 18.2% |
| 6 | 3 | < 0.1% |
| 7 | 396 | 3.0% |
| 8 | 10489 | |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 8 | 10489 | |
| 7 | 396 | 3.0% |
| 6 | 3 | < 0.1% |
| 5 | 2425 | 18.2% |
| 4 | 8 | 0.1% |
| 3 | 5 | < 0.1% |
| 2 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 0 | 20 | 0.1% |
| Distinct | 1870 |
|---|---|
| Distinct (%) | 17.5% |
| Missing | 2669 |
| Missing (%) | 20.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9085.449906 |
| Minimum | 1759 |
|---|---|
| Maximum | 79512 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 104.4 KiB |
Quantile statistics
| Minimum | 1759 |
|---|---|
| 5-th percentile | 3543 |
| Q1 | 5277 |
| median | 8372 |
| Q3 | 12373 |
| 95-th percentile | 15764 |
| Maximum | 79512 |
| Range | 77753 |
| Interquartile range (IQR) | 7096 |
Descriptive statistics
| Standard deviation | 4610.904239 |
|---|---|
| Coefficient of variation (CV) | 0.5075042278 |
| Kurtosis | 13.3157598 |
| Mean | 9085.449906 |
| Median Absolute Deviation (MAD) | 3382 |
| Skewness | 1.813937941 |
| Sum | 97032605 |
| Variance | 21260437.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10262 | 258 | 1.9% |
| 10844 | 212 | 1.6% |
| 7229 | 162 | 1.2% |
| 4804 | 160 | 1.2% |
| 4823 | 131 | 1.0% |
| 14714 | 109 | 0.8% |
| 3943 | 104 | 0.8% |
| 15129 | 93 | 0.7% |
| 3841 | 91 | 0.7% |
| 12898 | 86 | 0.6% |
| Other values (1860) | 9274 | |
| (Missing) | 2669 | 20.0% |
| Value | Count | Frequency (%) |
| 1759 | 4 | < 0.1% |
| 1840 | 1 | < 0.1% |
| 1965 | 36 | |
| 2017 | 35 | |
| 2050 | 10 | 0.1% |
| 2071 | 6 | < 0.1% |
| 2175 | 7 | 0.1% |
| 2227 | 40 | |
| 2228 | 9 | 0.1% |
| 2385 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 79512 | 1 | < 0.1% |
| 62427 | 1 | < 0.1% |
| 57209 | 1 | < 0.1% |
| 54826 | 3 | |
| 52285 | 1 | < 0.1% |
| 52229 | 1 | < 0.1% |
| 46490 | 1 | < 0.1% |
| 36983 | 1 | < 0.1% |
| 36235 | 2 | |
| 35185 | 1 | < 0.1% |
Date
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.39036632 |
| Minimum | 1 |
|---|---|
| Maximum | 27 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 12 |
| Q3 | 21 |
| 95-th percentile | 27 |
| Maximum | 27 |
| Range | 26 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.44003865 |
|---|---|
| Coefficient of variation (CV) | 0.6303067779 |
| Kurtosis | -1.25973729 |
| Mean | 13.39036632 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.1349670303 |
| Sum | 178748 |
| Variance | 71.23425241 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 1768 | |
| 6 | 1626 | |
| 21 | 1367 | |
| 27 | 1350 | |
| 1 | 1349 | |
| 24 | 1307 | |
| 15 | 1251 | |
| 12 | 1212 | |
| 3 | 1083 | |
| 18 | 1036 |
| Value | Count | Frequency (%) |
| 1 | 1349 | |
| 3 | 1083 | |
| 6 | 1626 | |
| 9 | 1768 | |
| 12 | 1212 | |
| 15 | 1251 | |
| 18 | 1036 | |
| 21 | 1367 | |
| 24 | 1307 | |
| 27 | 1350 |
| Value | Count | Frequency (%) |
| 27 | 1350 | |
| 24 | 1307 | |
| 21 | 1367 | |
| 18 | 1036 | |
| 15 | 1251 | |
| 12 | 1212 | |
| 9 | 1768 | |
| 6 | 1626 | |
| 3 | 1083 | |
| 1 | 1349 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 756.2 KiB |
| 5 | |
|---|---|
| 6 | |
| 3 | |
| 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 3 |
| 5th row | 6 |
Common Values
| Value | Count | Frequency (%) |
| 5 | 4328 | |
| 6 | 4284 | |
| 3 | 3410 | |
| 4 | 1327 | 9.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 5 | 4328 | |
| 6 | 4284 | |
| 3 | 3410 | |
| 4 | 1327 | 9.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 795.3 KiB |
| 2019 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2019 |
| 3rd row | 2019 |
| 4th row | 2019 |
| 5th row | 2019 |
Common Values
| Value | Count | Frequency (%) |
| 2019 | 13349 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2019 | 13349 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.39605963 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 411 |
| Zeros (%) | 3.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 8 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 6.896702582 |
|---|---|
| Coefficient of variation (CV) | 0.5148306869 |
| Kurtosis | -1.077425526 |
| Mean | 13.39605963 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.3844995427 |
| Sum | 178824 |
| Variance | 47.5645065 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 2056 | |
| 12 | 1094 | 8.2% |
| 4 | 1012 | 7.6% |
| 21 | 898 | 6.7% |
| 22 | 837 | 6.3% |
| 1 | 688 | 5.2% |
| 18 | 640 | 4.8% |
| 23 | 608 | 4.6% |
| 8 | 594 | 4.4% |
| 10 | 593 | 4.4% |
| Other values (14) | 4329 |
| Value | Count | Frequency (%) |
| 0 | 411 | |
| 1 | 688 | |
| 2 | 92 | 0.7% |
| 3 | 61 | 0.5% |
| 4 | 1012 | |
| 5 | 95 | 0.7% |
| 6 | 64 | 0.5% |
| 7 | 518 | |
| 8 | 594 | |
| 9 | 591 |
| Value | Count | Frequency (%) |
| 23 | 608 | 4.6% |
| 22 | 837 | |
| 21 | 898 | |
| 20 | 489 | 3.7% |
| 19 | 2056 | |
| 18 | 640 | 4.8% |
| 17 | 242 | 1.8% |
| 16 | 447 | 3.3% |
| 15 | 222 | 1.7% |
| 14 | 360 | 2.7% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.66064874 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 1827 |
| Zeros (%) | 13.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 10 |
| median | 25 |
| Q3 | 35 |
| 95-th percentile | 50 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 16.5570425 |
|---|---|
| Coefficient of variation (CV) | 0.6713952529 |
| Kurtosis | -1.038536052 |
| Mean | 24.66064874 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.1117482913 |
| Sum | 329195 |
| Variance | 274.1356562 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1827 | |
| 15 | 1612 | |
| 25 | 1598 | |
| 35 | 1364 | |
| 20 | 1106 | |
| 30 | 1062 | |
| 50 | 935 | |
| 45 | 889 | |
| 5 | 839 | |
| 40 | 785 | |
| Other values (2) | 1332 |
| Value | Count | Frequency (%) |
| 0 | 1827 | |
| 5 | 839 | |
| 10 | 717 | 5.4% |
| 15 | 1612 | |
| 20 | 1106 | |
| 25 | 1598 | |
| 30 | 1062 | |
| 35 | 1364 | |
| 40 | 785 | |
| 45 | 889 |
| Value | Count | Frequency (%) |
| 55 | 615 | 4.6% |
| 50 | 935 | |
| 45 | 889 | |
| 40 | 785 | |
| 35 | 1364 | |
| 30 | 1062 | |
| 25 | 1598 | |
| 20 | 1106 | |
| 15 | 1612 | |
| 10 | 717 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.51277249 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 51 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 11 |
| Q3 | 18 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 5.736752484 |
|---|---|
| Coefficient of variation (CV) | 0.4584717326 |
| Kurtosis | -1.197785085 |
| Mean | 12.51277249 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.1092454768 |
| Sum | 167033 |
| Variance | 32.91032906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 1150 | 8.6% |
| 7 | 1067 | 8.0% |
| 8 | 872 | 6.5% |
| 6 | 863 | 6.5% |
| 17 | 847 | 6.3% |
| 20 | 826 | 6.2% |
| 5 | 776 | 5.8% |
| 11 | 714 | 5.3% |
| 19 | 709 | 5.3% |
| 10 | 677 | 5.1% |
| Other values (14) | 4848 |
| Value | Count | Frequency (%) |
| 0 | 51 | 0.4% |
| 1 | 44 | 0.3% |
| 2 | 228 | 1.7% |
| 3 | 30 | 0.2% |
| 4 | 219 | 1.6% |
| 5 | 776 | |
| 6 | 863 | |
| 7 | 1067 | |
| 8 | 872 | |
| 9 | 1150 |
| Value | Count | Frequency (%) |
| 23 | 189 | 1.4% |
| 22 | 486 | |
| 21 | 625 | |
| 20 | 826 | |
| 19 | 709 | |
| 18 | 553 | |
| 17 | 847 | |
| 16 | 602 | |
| 15 | 431 | |
| 14 | 647 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.50333358 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 2590 |
| Zeros (%) | 19.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 25 |
| Q3 | 40 |
| 95-th percentile | 55 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 18.8329269 |
|---|---|
| Coefficient of variation (CV) | 0.7685863164 |
| Kurtosis | -1.304691908 |
| Mean | 24.50333358 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.1596940484 |
| Sum | 327095 |
| Variance | 354.6791356 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2590 | |
| 30 | 1491 | |
| 55 | 1332 | |
| 45 | 1106 | |
| 10 | 1099 | |
| 5 | 951 | 7.1% |
| 15 | 875 | 6.6% |
| 25 | 863 | 6.5% |
| 20 | 819 | 6.1% |
| 35 | 812 | 6.1% |
| Other values (2) | 1411 |
| Value | Count | Frequency (%) |
| 0 | 2590 | |
| 5 | 951 | 7.1% |
| 10 | 1099 | |
| 15 | 875 | 6.6% |
| 20 | 819 | 6.1% |
| 25 | 863 | 6.5% |
| 30 | 1491 | |
| 35 | 812 | 6.1% |
| 40 | 646 | 4.8% |
| 45 | 1106 |
| Value | Count | Frequency (%) |
| 55 | 1332 | |
| 50 | 765 | |
| 45 | 1106 | |
| 40 | 646 | |
| 35 | 812 | |
| 30 | 1491 | |
| 25 | 863 | |
| 20 | 819 | |
| 15 | 875 | |
| 10 | 1099 |
| Distinct | 43 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.23335081 |
| Minimum | 1 |
|---|---|
| Maximum | 47 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 52.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 8 |
| Q3 | 15 |
| 95-th percentile | 26 |
| Maximum | 47 |
| Range | 46 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 8.472646818 |
|---|---|
| Coefficient of variation (CV) | 0.8279445289 |
| Kurtosis | -0.144615562 |
| Mean | 10.23335081 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.8580624796 |
| Sum | 136605 |
| Variance | 71.7857441 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 2967 | |
| 1 | 785 | 5.9% |
| 3 | 627 | 4.7% |
| 5 | 608 | 4.6% |
| 7 | 600 | 4.5% |
| 9 | 551 | 4.1% |
| 12 | 538 | 4.0% |
| 8 | 531 | 4.0% |
| 13 | 516 | 3.9% |
| 11 | 467 | 3.5% |
| Other values (33) | 5159 |
| Value | Count | Frequency (%) |
| 1 | 785 | 5.9% |
| 2 | 2967 | |
| 3 | 627 | 4.7% |
| 4 | 278 | 2.1% |
| 5 | 608 | 4.6% |
| 6 | 442 | 3.3% |
| 7 | 600 | 4.5% |
| 8 | 531 | 4.0% |
| 9 | 551 | 4.1% |
| 10 | 459 | 3.4% |
| Value | Count | Frequency (%) |
| 47 | 2 | < 0.1% |
| 42 | 2 | < 0.1% |
| 41 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 39 | 3 | < 0.1% |
| 38 | 41 | |
| 37 | 22 | |
| 36 | 11 | 0.1% |
| 35 | 10 | 0.1% |
| 34 | 9 | 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1282 |
| Missing (%) | 9.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.35990718 |
| Minimum | 5 |
|---|---|
| Maximum | 55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 104.4 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 20 |
| median | 30 |
| Q3 | 45 |
| 95-th percentile | 55 |
| Maximum | 55 |
| Range | 50 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.88542329 |
|---|---|
| Coefficient of variation (CV) | 0.4746641373 |
| Kurtosis | -1.059999682 |
| Mean | 31.35990718 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | -0.03320228296 |
| Sum | 378420 |
| Variance | 221.5758265 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 1818 | |
| 20 | 1260 | |
| 50 | 1205 | |
| 45 | 1153 | |
| 35 | 1149 | |
| 15 | 1135 | |
| 55 | 1121 | |
| 25 | 1009 | |
| 40 | 803 | |
| 5 | 767 | |
| (Missing) | 1282 |
| Value | Count | Frequency (%) |
| 5 | 767 | |
| 10 | 647 | 4.8% |
| 15 | 1135 | |
| 20 | 1260 | |
| 25 | 1009 | |
| 30 | 1818 | |
| 35 | 1149 | |
| 40 | 803 | |
| 45 | 1153 | |
| 50 | 1205 |
| Value | Count | Frequency (%) |
| 55 | 1121 | |
| 50 | 1205 | |
| 45 | 1153 | |
| 40 | 803 | |
| 35 | 1149 | |
| 30 | 1818 | |
| 25 | 1009 | |
| 20 | 1260 | |
| 15 | 1135 | |
| 10 | 647 | 4.8% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | Airline | Source | Destination | Duration | Total_Stops | Additional_Info | Price | Date | Month | Year | Arrival_hour | Arrival_min | Dept_hour | Dept_min | duration_hour | duration_min | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 3 | 0 | 5 | 2h 50m | 0.0 | 8 | 3897.0 | 24 | 3 | 2019 | 1 | 10 | 22 | 20 | 2 | 50 |
| 1 | 1 | 1 | 3 | 0 | 7h 25m | 2.0 | 8 | 7662.0 | 1 | 5 | 2019 | 13 | 15 | 5 | 50 | 7 | 25 |
| 2 | 3 | 3 | 3 | 0 | 5h 25m | 1.0 | 8 | 6218.0 | 12 | 5 | 2019 | 23 | 30 | 18 | 5 | 5 | 25 |
| 3 | 4 | 3 | 0 | 5 | 4h 45m | 1.0 | 8 | 13302.0 | 1 | 3 | 2019 | 21 | 35 | 16 | 50 | 4 | 45 |
| 4 | 5 | 8 | 3 | 0 | 2h 25m | 0.0 | 8 | 3873.0 | 24 | 6 | 2019 | 11 | 25 | 9 | 0 | 2 | 25 |
| 5 | 6 | 4 | 0 | 5 | 15h 30m | 1.0 | 5 | 11087.0 | 12 | 3 | 2019 | 10 | 25 | 18 | 55 | 15 | 30 |
| 6 | 7 | 4 | 0 | 5 | 21h 5m | 1.0 | 8 | 22270.0 | 1 | 3 | 2019 | 5 | 5 | 8 | 0 | 21 | 5 |
| 7 | 8 | 4 | 0 | 5 | 25h 30m | 1.0 | 5 | 11087.0 | 12 | 3 | 2019 | 10 | 25 | 8 | 55 | 25 | 30 |
| 8 | 9 | 6 | 2 | 1 | 7h 50m | 1.0 | 8 | 8625.0 | 27 | 5 | 2019 | 19 | 15 | 11 | 25 | 7 | 50 |
| 9 | 10 | 1 | 2 | 1 | 13h 15m | 1.0 | 8 | 8907.0 | 1 | 6 | 2019 | 23 | 0 | 9 | 45 | 13 | 15 |
Last rows
| df_index | Airline | Source | Destination | Duration | Total_Stops | Additional_Info | Price | Date | Month | Year | Arrival_hour | Arrival_min | Dept_hour | Dept_min | duration_hour | duration_min | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 13339 | 2661 | 4 | 2 | 1 | 33h 15m | 2.0 | 8 | NaN | 27 | 3 | 2019 | 4 | 25 | 19 | 10 | 33 | 15 |
| 13340 | 2662 | 1 | 4 | 3 | 1h 30m | 0.0 | 8 | NaN | 21 | 5 | 2019 | 15 | 25 | 13 | 55 | 1 | 30 |
| 13341 | 2663 | 2 | 3 | 0 | 8h 15m | 1.0 | 8 | NaN | 12 | 5 | 2019 | 7 | 45 | 23 | 30 | 8 | 15 |
| 13342 | 2664 | 6 | 2 | 1 | 10h 15m | 1.0 | 8 | NaN | 15 | 6 | 2019 | 1 | 30 | 15 | 15 | 10 | 15 |
| 13343 | 2665 | 8 | 4 | 3 | 1h 30m | 0.0 | 7 | NaN | 21 | 6 | 2019 | 0 | 15 | 22 | 45 | 1 | 30 |
| 13344 | 2666 | 1 | 3 | 0 | 23h 55m | 1.0 | 8 | NaN | 6 | 6 | 2019 | 20 | 25 | 20 | 30 | 23 | 55 |
| 13345 | 2667 | 3 | 3 | 0 | 2h 35m | 0.0 | 8 | NaN | 27 | 3 | 2019 | 16 | 55 | 14 | 20 | 2 | 35 |
| 13346 | 2668 | 4 | 2 | 1 | 6h 35m | 1.0 | 8 | NaN | 6 | 3 | 2019 | 4 | 25 | 21 | 50 | 6 | 35 |
| 13347 | 2669 | 1 | 2 | 1 | 15h 15m | 1.0 | 8 | NaN | 6 | 3 | 2019 | 19 | 15 | 4 | 0 | 15 | 15 |
| 13348 | 2670 | 6 | 2 | 1 | 14h 20m | 1.0 | 8 | NaN | 15 | 6 | 2019 | 19 | 15 | 4 | 55 | 14 | 20 |